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^ Abstract 

^> We consider a multiple-input-single-output (MISO) broadcast channel with mixed channel 

state information at the transmitter (CSIT) that consists of imperfect current CSIT and perfect 
outdated CSIT. Recent work by Kobayashi et al. presented a scheme which exploits both 
— ^ imperfect current CSIT and perfect outdated CSIT and achieves higher degrees of freedom 

■^ (DoF) than possible with only imperfect current CSIT or only outdated CSIT individually In 

this work, we further improve the achievable DoF in this setting by incorporating additional 
Q private messages, and provide a tight information theoretic DoF outer bound, thereby identifying 

^^ the DoF optimal use of mixed CSIT. The new result is stronger even in the original setting of 

only delayed CSIT, because it allows us to remove the restricting assumption of statistically 
►^ equivalent fading for all users. 



^ 



o 

en 1 Introduction 



CO Channel state information at transmitter (CSIT) is an important issue when designing communi- 

^ cation systems, and can be available in a variety of forms. Consider the following CSIT models for 

a two user MISO broadcast channel (BC) where the transmitter is equipped with two antennas, 
each receiver has one antenna and the channels vary in an i.i.d. fashion across time. 
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S^ 1. Perfect current CSIT: This is the setting where the transmitter knows the instantaneous 



^ 



channels perfectly at time t. 

2. Delayed CSIT: This is the setting where at time t, the transmitter knows the channels up 
to time t — 1 perfectly. 

3. Delayed CSIT and imperfect current CSIT: This is the setting where the transmitter 
has delayed CSIT and also has partial knowledge of the channels at time t. 

When the channel is changing very slowly, perfect current CSIT is a reasonable assumption. Under 
this model, zero forcing at the transmitter allows each user to achieve 1 DoF which is also its 
interference-free DoF. On the other hand, if the channel is changing very rapidly, delayed CSIT 
is a reasonable assumption. A channel that changes from symbol to symbol in an i.i.d. fashion, 
making any CSIT completely outdated (i.e., independent of the current channel state), naturally 
represents a worst case scenario. Surprisingly, even in this case, a DoF gain can be obtained due 
to retrospective interference alignment [2j, [3J. For many practical settings, however, at least some 
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Figure 1: DoF as a function of a 

imperfect knowledge of current channel state may be available in addition to the past channel 
state information. This can happen, e.g., due to temporal channel correlations when the CSIT 
feedback is not too delayed, or, e.g., due to the availability of a feedback channel with significant 
delay (providing outdated CSIT) in addition to the observations from a reverse channel (due to 
the two-way nature of communication) which may provide an (imperfect) estimate of the current 
channel state. In this work we explore how the transmitter can optimally use this mixed CSIT, 
from a DoF perspective. 



1.1 Overview of results 

We consider a two user MISO broadcast channel where the transmitter has two antennas. It is 
assumed that the transmitter has perfect delayed CSI and imperfect current CSI. We are interested 
in characterizing the DoF of this channel. To better understand the results, we present the channel 
model in its simplest form as follows. The rationale for this simple form will be explained in detail 
in Section [2m 



yiit) = \^ xi{t) ^ h{t)^P^ X2{t) + ziit) 
y2{t) = y/P X2{t) + g{t)y/P^ xi{t) + Z2{t) 



(1) 
(2) 



Here, during channel use t, yi{t) is the received signal at receiver z, Zi{t) is the additive white 
Gaussian noise with zero mean and finite variance at receiver z, xi{t) is the projection of the trans- 
mitted signal along the dimension that is orthogonal to the estimated current channel of receiver 
2 while X2{t) is the projection of the transmitted signal along the dimension that is orthogonal to 
the estimated current channel of receiver 1. Essentially, by a change of basis operation (no loss 
of generality for DoF), the effective transmit antennas are aligned with the zero forcing signaling 
dimensions based on the imperfect current CSIT. Since the current channel is not known perfectly, 
perfect zero-forcing of signals is not possible, so that h{t) and g{t) are the channels for the non-zero- 
forced part of the signals. They are modeled as unknown channel coefficients at time t, and will 
be known to the transmitter after a significant delay (outdated CSIT). Due to imperfect current 
CSIT, P^ represents the residual signal power after zero forcing, whose strength is measured by 



a, a parameter that ranges between zero and one. When o: = 0, it corresponds to the case when 
current CSIT is as good as perfect, because the residual signal power after zero forcing is at the 
noise floor level and has no impact on DoF. In this case, simply zero-forcing allows each user to 
achieve 1 DoF, for a total of 2 DoF which is also the maximum DoF of the channel even with 
perfect current CSIT. On the other hand, if a = 1, it corresponds to the case when there is no 
current CSIT, and zero-forcing is not able to reduce the signal strength. In this case, Maddah-Ali 
and Tse proposed an interference alignment scheme (which will be referred to as the MAT scheme) 
to achieve the optimal DoF of | [2J . 

For < a < 1, zero-forcing, which only uses current CSI, achieves 2 — 2a DoF, while the MAT 
scheme, which only uses delayed CSI, achieves 4/3 DoF regardless of a. Recently, in [Ij, Kobayashi 
et al. proposed an interesting scheme which exploits both current and delayed CSI to achieve | , ^^ 
DoFj^ [Ij. The achievable DoF curve of [Ij is shown in Fig. [I] In this work, we build our scheme 
upon that of Kobayashi et al. and further improve it to achieve 2 — ^a DoF, which represents a 
straight line that goes from a sum DoF value of 2 when a = to a sum DoF value of 4/3 when 
q; = 1 as shown in Fig. [IJ The new achievable scheme provides the largest DoF gain of 4/9 over 
best of ZF and MAT schemes at a = 1/3. 

We also derive a DoF outer bound for the mixed CSIT setting, where no non-trivial outer 
bound was previously available, to show that 2 — |q: DoF is in fact optimal. In addition to being 
tight, the outer bound possesses another robust feature — it does not force the i.i.d. assumption 
across users, i.e., the users can have different channel distributions. In contrast, the assumption 
of statistically equivalent fading for all users is used for the original outer bound derived for the 
MISO broadcast channel with only delayed CSIT [2]. Therefore, apparently even in the original 
setting of only delayed CSIT, our result further strengthens the result of [2]. 

1.2 The achievable schemes: key ideas 

In this section, we provide an intuitive description of our achievable scheme. Since our scheme is 
built upon that of [IJ (which is built upon 12J), we first review the scheme in ^Lj. 

1.2.1 Achievable scheme of Kobayashi et al. fl] 

Following the principle of MAT scheme, in the scheme of Kobayashi et al. also, the interference 
initially seen at each receiver is multicast to both receivers in subsequent transmission phases, such 
that the undesired receiver can cancel the previously seen interference while the desired receiver 
can use this additional observation to resolve its desired symbols. However, unlike MAT scheme 
which assumes no knowledge of current channel state, Kobayashi et al. also simultaneously take 
advantage of the partial zero forcing capability provided by the imperfect current CSIT. Specifically, 
the scheme consists of three phases. 

In the first phase, which consists of one time slot, the transmitter simultaneously sends four 
symbols ai, a2, 6i, 62, where ai, a2 are intended for receiver 1 and 61, 62 are intended for receiver 2. 
The symbols ai and 61 exploit the partial zero forcing capability, as they are sent along the directions 
that are orthogonal to the imperfectly known current channel states of receiver 2 and 1, respectively. 
Due to partial zero- forcing, ai,6i are received with full power P at the desired receivers, but only 
at power P^ at the undesired receivers. Having exhausted the partial zero-forcing capability with 



^The parameter a used in this paper is different from that used in 1 . As will be explained in the following part 
of the paper, suppose we denote the a used in 1 as a . Then a in this paper is related to a through a = 1 — a . 



ai 



(12 



b2 



pa 



Figure 2: Power levels of symbols at receiver 1 in the first time slot 



ai,6i, the transmitter simultaneously sends a2 and 62 along generic directions to guarantee linear 
independence with ai,6i. However, a2,62 are allocated less poweijj so that they are received at 
power level P^ at both receivers. Note that this is the same power level as ai and 61 at the 
undesired receivers. As shown in Fig. [2j bi and 62 are received at the same power level P^ at 
receiver 1, as is a2. 

In the next phase, the transmitter wants to deliver the linear combination of 61 and 62 seen by 
receiver 1 and that of ai and a2 seen by receiver 2 to both receivers. A key novel idea of [1] is to 
quantize the interference and then multicast it to both receivers digitally instead of directly sending 
the analog interference signal as the MAT scheme. Now consider quantizing the interference given 
by the linear combination of 61 and 62 at receiver 1. As shown in Fig. [2J since 61,62 have power 
P^ at receiver 1, the linear combination can be quantized to within unit approximation error using 
approximately alogP bits. These bits will be coded using a multicast code and then be sent to 
both receivers. Since the channel has 1 DoF for multicast to both receivers, i.e., rate logP for 
sending common information, the number of time slots needed to send these bits is approximately 
a. 

Similarly, in the third phase the transmitter will use a time slots to multicast the quantized 
linear combination of ai and a2 seen by receiver 2 in the first time slot. As a consequence, combining 
all three phases, after 1 + 2a time slots each user can decode two desired symbols to achieve - — — 
DoF. 



l+2a 



1.3 New (optimal) Scheme: Improving upon the achievable scheme of p!] 

Our scheme also consists of three phases. The main reason for the improvement is that while the 
scheme of Kobayashi et al. uses the partial zero forcing capability of the channel only in Phase 1, 
our scheme uses the partial zero forcing capability in every transmission. Phase 1 of our scheme is 
the same as that of Kobayashi et al. in [Ij. For the second phase, just like Kobayashi et al., we also 
multicast the common information (denoted by ci in Fig. [s]) which is simply the quantization of the 
linear combination of 61 and 62 seen by receiver 1. However, unlike the scheme of Kobayashi et al., 
in addition to multicasting this common information, we also simultaneously send two new private 
messages, one for each receiver using the partial zero forcing capability of the channel. Denote 
these additional symbols for receiver 1 and 2 as as and 63, respectively. Then as and 63 are sent 
from xi and X2, respectively, in (fl]) and S, i.e., they are sent along the imperfectly known zero- 
forcing directions. Moreover, we allocate only enough power to as,bs such that they are received 
at noise floor level at the unintended receivers. From ^ and Q, it can be seen the allocated 

^According to the model (llj), ([2|), the power allocated to a2, 62 is simply P^~^ so that the dominant received power 
term from these symbols is P" at both receivers, and the choice of generic directions simply implies transmission 
from both antennas with different linear weights. 



power is P ^. With this power allocation, they will be received at the power level P^ ^ at the 
desired receivers. The power levels of symbols at two receivers are shown in Fig. [3] Each receiver 
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(a) Power levels of symbols at receiver 1 
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(b) Power levels of symbols at receiver 2 



Figure 3: Our scheme: The power levels of symbols at receiver 1 and 2 in the second time slot 

will decode the common information ci first, subtract it out and then decode the private message. 
Due to the private message, the noise level for the common message is raised to P^~^. Thus, the 
common message can achieve a rate approximately log(P/P^~^) = a log P. Each private message 
can achieve a rate approximately (1 — a) log P. Since the common message contains aXogP bits, 
only one time slot is need to deliver it. 

The third phase is similar to second phase, i.e., we send the common information, C2, carrying 
the quantized linear combination of ai and a2 as seen by receiver 2 in the first time slot, and two 
private messages, a4, 64, one for each receiver. The remaining details are the same as phase 2. 

As a consequence, over 3 time slots, each user can decode four desired symbols to achieve 

We also provide an information theoretic outer bound that proves this DoF value is optimal. 
The DoF value can be written as 2(|q: + 1 — a) and can be interpreted as ''each user can achieve 

1 a DoF due to retrospective interference alignment and 1 — a DoF due to zero-forcing.^^ 

2 System Model 

We follow a similar model to Kobayashi et al. [Ij, except that we assume i.i.d. temporal variationsrl 
The channel input-output relationship at time t is given by 



yi(t) = ht(i)x(f) + zi(t) 

y2{t) = gt(t)x(f) + Z2(t) 



(3) 
(4) 



where yi{t) is the observed signal at receiver z, x(t) = [xi{t) X2{t)]^ is the 2x1 input signal 
satisfying the power constraint £^[||x(t)|p] < P and Zi{t) ^ CA/'(0, 1) is the circularly symmetric 
complex additive white Gaussian noise (AWGN). h(t) and g(t) are two 2x1 channel vectors to 
receiver 1 and 2, respectively. In addition, we assume that 



h(t): 

sit) 



hit) + Ht) 

iit) + g{t) 



(5) 
(6) 



"^The assumption of i.i.d. temporal variations is significant only for the DoF outer bound. For achievability, our 
scheme works perfectly with the same assumptions as Kobayashi et al. as well. 



where h(t) and g(t) are the estimated channels while h(t) and g(t) are the estimation errors. We 
assume that h(t), h(t), g(t), g(t) are drawn from continuous distributions, independent of each other 
(although not necessarily identically distributed), and vary in an i.i.d. fashion in time. 

The estimated channels and estimation errors are assumed to be zero mean with covariance 
matrices (1 — cr^)I and a^I (cr^ < 1), respectively. Receivers know the instantaneous channels 
perfectly. Regarding the channel state information at the transmitter, it is assumed that the 
transmitter has 1) perfect delayed CSI and 2) imperfect current CSI, i.e., at time t the transmitter 
knows perfectly h(l), • • • , h(t — 1) and g(l), • • • , g(t — 1) as well as h(t) and g(t). We define 

logP ^ ^ 

Essentially, the parameter a^ measures the quality of the current channel estimation. If a' = 0, 
then it corresponds to the case when there is no current CSI. If a' > 1, then it corresponds to the 
case that the current CSI is as good as perfect (for DoF). Note that in [Ij, the definition of a' is 
slightly different from our definition where a' is defined as limp^oo ~\o^p • 

There are two independent messages, one for each receiver. We denote the size of message 
Wk as \Wk\, k E {1,2}. For the codewords spanning n channel uses, the rates R^ = log(|VF/e|)/^ 
are achievable if the probability of error for both messages can be simultaneously made arbitrarily 
small by choosing an appropriately large n. The sum capacity C^{P) is the maximum achievable 
sum rate. The number of degrees of freedom is defined as 

d = lim ^^ (8) 

P^oc logP ^ ^ 

Note that there is a subtle thematic distinction between this work and that of Kobayashi et al. 
in that our focus is limited to optimal use of current and outdated channel information whereas 
Kobayashi et al. also incorporate temporal correlations. The thematic distinction leads to slightly 
different channel models. While Kobayashi et al. deal with additional complexities of temporal 
correlations and non-ergodic settings, we are able to ignore temporal correlations (i.i.d. fading 
in time) while capturing the essential aspect of availability of both current and outdated channel 
knowledge. While our model is simpler than p[J in ignoring temporal correlations, and closer to the 
original model of [2J , we expect that from a DoF perspective the two models will produce equivalent 
results and shed similar insights into the same essential question that motivates the two works. 

Next we justify the equivalence of the model presented above (along the lines of Kobayashi et 
al.) and the simpler model in (fl]), S used for the overview in the introduction section. 

2.1 An equivalent model 

We can perform a sequence of invertible operations at transmitter and receivers to convert the 
channel to its simplest form. Since invertible transformations do not affect the DoF of the channel, 
there is no loss of generality, i.e., the resulting channel has the same DoF as the original channel. 
First, we perform an invertible linear transformation at the transmitter. This is done by multiplying 
a 2 X 2 invertible matrix A(t) = [v(t) u(t)] to the transmitted signal, i.e., A(t)x(t) where x(t) = 
[xi{t) X2{t)]^ is the original channel input vector. v(t) and u(t) are unit norm vectors chosen in 
a manner that they are orthogonal to g(t) and li(t), respectively. As a consequence, the received 
signals at receiver 1 and 2 become 

yi{t) = h\t)^r{t)xl{t) + h\t)u{t)x2{t) + Zl{t) (9) 

y2{t) = g\t)u{t)x2{t) + g\t)^r{t)xl{t) + Z2{t) (10) 



Then receiver i can normalize the channel coefficient of Xi{t) to unity such that the channel becomes 

yi{t) ht(f)u(f) zijt) 

= ^nt) + ,iu\..u\ ^-2[t) + i.iu\..u\ (11) 



ht(t)v(t) ^'^ ht(t)v(t)^'^ ht(i)v(t) 

■^ V ' "^ V ' 

y'lit) 4W 

?/2(*) ^ ... ^ g^(t)v(t) ^2(t) .. .. 

^ ^2(ij + t/,N /,N ^l(^) + t/,N /,N (12) 



gt(i)u(t) ^^ ^ gt(t)u(t) ^^ ^ gt(t)u(t) 

2/2 W 4(^) 



Now let us define h(t) = h(t)/vP^^ and g(t) = g(t)/VP~^\ both of which have covariance matrix 
I. In addition, we normalize the transmit power to unity by absorbing it into the channel coefficients 
and defining Xi{t) = \fPx[{t). Then the channel input-output relationship can be written as 

y[{t) = VPxUt) + VP^=^|^4(t) + ^l(t) (13) 

^ V ' 

h{t) 

y',{t) = ^:,'^(t)^^^^^^^:,'^{t) + z',{t) (14) 

^ V ' 

9{t) 

where h{t) and g{t) have finite variances independent of P and they are only known to the trans- 
mitter with one time delay. By setting a = 1 — a^ and with a little bit of abuse of notations, we 
end up with the following simple channel model: 

yi{t) = VPxi{t) ^ VP^h{t)x2{t) + zi{t) (15) 

y2{t) = ^/Px2{t) + ^/P^g{t)xl{t) + Z2{t) (16) 

Since a' is defined to be greater than or equal to zero and a = 1 — a\ it follows that a < 1. 

3 Results 

When q: < 0, the problem becomes trivial. It can be easily seen that simply zero-forcing at the 
transmitter can achieve 2 DoF which is also the maximum achievable DoF. Therefore, we only 
consider the case when < a < 1 in this section. The main result of this paper is presented in the 
following theorem. 

Theorem 1 For the MISO broadcast channel with mixed CSIT defined in Section^ 

2 

d = 2--a, 0<a<l. (17) 

o 

This theorem establishes both the achievable DoF and its optimality. In terms of outer bounds, this 
is the first non-trivial outer bound for the mixed CSIT setting. However, the result is interesting 
even in the original setting of [2J with only delayed CSIT, i.e., with no current CSIT. This is because 
our outer bound does not require that the two users are statistically equivalent, i.e., the users can 
have different fading distributions. In contrast, the outer bounds for BC with delayed CSIT, e.g., 
in [2], require that two users are statistically equivalent. Therefore, evidently we have a stronger 
result in the original setting as well. 



4 Outer bounds 

In this section, we provide the outer bound proof. We start first with the original setting of [2j 
with only delayed CSIT and provide an alternate proof that does not require statistically equivalent 
fading for the two users. Our outer bound follows a compound channel approach. 

4.1 Outer bound for the delayed CSIT setting 

Consider the following BC with delayed CSIT as in [2j. 

yi = htx + zi (18) 

y2 = g^yi + Z2 (19) 

For simplicity we omit the time index. Recall that we assume the channels are i.i.d. in time and 
independent across users but not necessarily identically distributed across users. Now suppose we 
provide yi to User 2 so that it has both yi, ^2, which makes the channel physically degraded. For a 
physically degraded BC without memory (this requirement of memoryless channels is the primary 
reason that we restrict the model to i.i.d. fading in time), feedback does not increase capacity, so 
we can eliminate the delayed CSIT feedback for this new channel. Now, let us impose a compound 
setting on h and g, which is consistent with the outer bound argument, i.e. the compound setting 
does not decrease the capacity of the original channel. To see this, suppose we first introduce 
another pair of receivers, one for each user, that are statistically equivalent to the original receivers 
and require the same messages. Since the additional receivers have the same decoding capabilities as 
the original receivers, the capacity region is not decreased. Now we provide full channel knowledge 
to the transmitter, which also cannot reduce capacity (this step is not necessary, but it shows the 
strength of the outer bound). This puts us into a two state compound BC setting. Since at no 
point did we reduce the capacity, the outer bound for this compound setting is also an outer bound 
for the original channel. 

So now we have two more fictional outputs: 

y[ = h^tx + zl 
y'2 = g'^x + 4 

Note that in the compound setting the transmitter knows that the channel vectors can be either 
h, g or h',g^ Essentially now we have two BCs controlled by the same inputs. In the first BC, 
User 1 sees yi and wants message VFi, and User 2 sees ^1, ^2 and wants message W2. In the second 
BC, the first user sees y[ and wants message Wi and the second user sees y[,y2 and wants message 
W2. To derive the outer bound, we start with the first BC. From Fano's inequality, 

nRi < I{Wr,y'l) + o{n) (20) 

= h{y^) - Hy'llWi) + oin) (21) 

< nlog{P) - hiv'^lWi) + n o{log{P)) + o{n) (22) 

Proceeding similarly with y[ instead of yi, we have the bound: 

nRi < nlog(P) - Hy'^'lWi) + n o(log(P)) + o{n) (23) 

Adding the two we have 

2nRi < 2nlog{P)-h{y'l,yT\Wi) + no{log{P)) + o{n) (24) 



Note that from y'^ and y'{^^ it is possible to invert the channel and to construct x within bounded 
variance noise distortion. Therefore, h{yi^y'i\Wi) = ni?2 + n o(log(P)) + o(n). Using this obser- 
vation, we proceed as follows. 

nRi + nR2 = I(Wi, W2; y^yf) + « o(log(P)) + o(n) (25) 

nR2 = I(W2;yr,yr|Wi) + no(log(P)) + o(n) (26) 

= h{y^,yT\Wr) - h{y^,yT\Wr,W2)+n o(log(P)) + o(n) 

^ V ' 

n o(log(F)) 

= %^yr|Wi) + no(log(P)) + o(n) (27) 



Adding (24) and (27), and writing it in DoF terms: 

2rfi + d2 < 2 (28) 

By symmetry, we can repeat the whole procedure by creating a degraded channel in the other 
direction (User 2 is degraded) to obtain the bound 

2^2 + rfi < 2 (29) 

Adding the two bounds, we have the final DoF outer bound 

di + d2< -. (30) 

Evidently, the outer bound applies even if the channel uncertainty at the transmitter is reduced 
to a one bit uncertainty representing a choice between only two independent realizations of each 
user's channel. Note that the bound only requires linear independence between the two realizations. 
Since the channels are drawn from a continuous distribution, this is true with probability 1. 

4.2 Outer bound for the mixed CSIT setting 



We will derive the outer bound based on the equivalent simple model given by equation (15). 
Similar to the outer bound for the delayed CSIT setting, we make the channel physically degraded 
by providing yi to User 2. Then we can eliminate the delayed CSIT feedback for this new channel 
since feedback does not increase capacity for a physically degraded memoryless BC. Let us impose 
a compound setting on /i, g, which is consistent with the outer bound argument. Now we have two 
more fictional outputs: 



y[ 



^ = ^/PXl + ^/P^h'x2 + Z[ 

'2 = \/Px2 + VP^^'xi + 4 



2/2 

Note that in the compound setting the transmitter knows that the cross-coefficients take values 
either h^g or h' ^g' . Essentially now we have two BCs controlled by the same inputs. To derive the 
outer bound, we start with the first BC. From Fano's inequality, 

nRi < liWi-y'D + oin) (31) 

< n\og{P) - h{y'^\Wi) + n o{\og{P)) + o{n) (32) 



Proceeding similarly with y[ instead of yi, we have the bound: 

nRi < nlog(P) - hiy'^'lWi) + n o(log(P)) + o{n) (33) 

Adding the two we have 

2ni?i < 2nlog(P)-%^|Wi)-%f|VFi) + no(log(P)) + o(n) (34) 

< 2nlog(P)-%^yf|Wi) + no(log(P)) + o(n) (35) 

To bound the remaining entropy term, next we want to show that 

ni?2 < Kvl, y^\Wx) + n(\ - a) log(P) + n o(log(P)) + o{n) (36) 

Note that from yx^y'^ we can do a change of basis to obtain 

y\,new = VPxx + Zx (37) 

y2,new = \/Px2 + 2^2 + ^ (38) 

where Zi -^ CA/'(0, 0(1)), and most importantly z ^ CA/'(0, P^""^). 

So, from yi,y[^ and z it is possible to obtain xi + 2:1, X2 + 2:2, i.e., it is possible to invert the 
channel within bounded variance noise distortion. Using this observation, we proceed as follows. 

nRi + nR2 = /(T^i, 1^2;^", yf, ^") + n o(log(P)) + o(n) (39) 

nR2 = IiW2;y'l,yT,z''\Wi) + noilogiP)) + o{n) (40) 

= IiW2;y'l,yT\Wi) + IiW2;z^\y'l,y[^,Wi) + noilog{P)) + o{n) (41) 

= h{y^, yT\W,) - h{y^, yT\Wi, W2) + /^(^"|y^ yf, W^) 

n o(log(P)) <h{z^) 

- /i(z'^|y^ yf , Wi,W2) + n o(log(P)) + o{n) 

< h{y^, yT\Wi) + h{z^) - h{z-\{z2 + 2)") +n o(log(P)) + o(n) (42) 

n o(log(P)) 

= %^yf|Wl) + n(l-a)log(P) + no(log(P)) + o(n) (43) 



Thus we have found a bound for the remaining entropy term. Adding ( |35| ) and ( |43| ), we have 

2nPi + nP2 < 2n log(P) + n(l - a) log(P) + n o(log(P)) + o(n) (44) 

Writing the equation in DoF terms, we have 

2di + d2 < 3- a (45) 

By symmetry we can repeat the whole procedure by creating a degraded channel in the other 
direction (User 2 is degraded) to obtain the bound: 

rfi + 2^2 < 3- a (46) 

Adding the two bounds, we have the final DoF outer bound 

di + d2 < 2 --a (47) 

o 

which represents a straight line that goes from a sum DoF value of di + (i2 = 2 when o: = to a 
sum DoF value of di + ^2 = 4/3 when a = 1. 

10 



5 Achievability 

In this section we provide the achievabihty proof for Theorem [l] The achievable scheme consists 
of three phases. Since the achievable schemes share many similarities with that derived in [Ij, we 
will use the equivalent model given by ([s]) for ease of exposition. 

Phase 1 

Phase 1 consists of one time slot and the transmitted signal is 

x(l) = viai + V2a2 + ui6i + U262 (48) 

where a^ and bi are symbols intended for user 1 and 2, respectively. Two independent Gaussian 
codebooks are used for [ai a2] and [61 62] with diagonal covariance matrices. The power of ai and 
61 is (P — P^~^ ~^)/2 where e is a positive number that can be chosen arbitrarily small while the 
power of a2 and 62 is pi~^ ~^/2 such that £^[||x(l)|p] < P. v^ and u^ are beamforming vectors with 
unit norm. We will design vi and Ui such that they are orthogonal to g(l) and h(l), respectively, 
i.e. 

gt(l)vi = (49) 

ht(l)ui = (50) 

V2 and U2 are chosen randomly, such that they are linearly independent with vi and Ui, respectively, 
with probability one. 

Now consider the received signal at receiver 1. 

^i(l) = ht(l)x(l) + zi(l) (51) 

= ht(l)viai + ht(l)v2a2 + (ht(l)ui6i + ht(l)u262) +zi(l) (52) 

" V ' 

m 
Similarly, the received signal at receiver 2 is 

^2(1) = gt(l)x(l) + Z2(l) (53) 

= g^(l)ui6i + gt(l)u262 + fg^(l)viai + gt(l)v2a2) +^2(1) (54) 



The power level of each symbol at each receiver is shown in Figjlj As we can see, due to partial 

zero- forcing of symbol ai and 61 at receiver 2 and receiver 1, respectively, they are received at 

power level of P^~^ although they are sent with power level of 0{P). On the other hand, a2 and 

62 cannot be partially zero-forced at the unintended receivers and we allocate the power such that 

they are received at the same power level of ai and 61 at receiver 2 and 1, respectively. 

As in [Ij, we will quantize the real and imaginary parts of rjk separately using a scalar truncated 

i+c 
uniform quantizer with unit step and truncation value r] — P 2 a, for some C > 0- Let us denote 

Vk = Vk + Ak (55) 

where fjk is the quantized value while A/^ is the quantization error. And fjk contains 

Rk = 2 log(2 \f]'])^2 + {l + C-a) log P bits (56) 
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(a) Received signal at Receiver 1 (b) Received signal at Receiver 2 

Figure 4: The power levels of symbols at two receivers in time slot 1 



f]k viewed as a message containing Rj^ bits will be encoded using a Gaussian codebook with code- 
words denoted by Ck- Then Ck will be sent to both receivers as a common information in the 
following time slots. Note that essentially phase 1 is the same as phase 1 of the scheme proposed 

in [IJ. 

Phase 2 

The goal of phase 2 is to deliver the common information ci to both receivers and at the same time 
to send two private messages each for one receiver. Suppose private messages for receiver 1 and 2 
are encoded using independent Gaussian codes with codewords denoted as as and 63, respectively. 
Consider the following transmitted signal (for simplicity we omit the time index) 



X = wci + vsas + U363 



(57) 



where w, V3 and U3 are beamforming vectors with unit norm, w is chosen randomly while V3 and 
U3 are chosen such that they are orthogonal to g and h, respectively. We set the powers of as, 63 
and ci as P^ /2, P^ /2 and P — P^ , respectively, such that £"[11x1^] = P. Then the received signal 
at receiver 1 is 



yi = h'^'x + zi 

= h' wci + h' V3a3 + h' U363 + 



zi. 



(58) 
(59) 



Now consider the power level of each symbol at receiver 1. It can be easily seen that £ ^[|h"^wcip] 



and £'[|hW3a3p] are on the order of P and P^ , respectively, as shown in Fig. 5(a 



For 63, it 

can be seen that £^[|h'^U363p] is 0(1), i.e. at the noise floor level. Similarly, the received signal at 
receiver 2 is 



?/2 = g'''x + Z2 



g^WCi + g'''u363 + g'''v3a3 + Z2 



6-U 



(60) 
(61) 



Again, as is received at the noise floor level while the power levels of ci and 63 are P and P^ , 
respectively, as shown in Fig. |5(b)[ Both receivers will flrst decode ci by treating other signals as 
noise. The achievable rate for ci is 



i?ci = min {/(ci; yi, h, g), /(ci; ^2, h, g)} 



(62) 
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(a) Received signal at Receiver 1 (b) Received signal at Receiver 2 

Figure 5: The power levels of symbols at two receivers in phase 2 

Next, we calculate these two mutual information terms. First, 

|htw|2(P-P«') 



I(ci;yi,h,g) 



E 



E 



log 1 + 



l + |htv3|2^ + |gtu3P^ 



log 1 + 



|htw|2(P-P«') 



l + |htv3P^ + |P=^gtu3P^ 



= E 



E 



log 1 + 



|htw|2(P-P"') 



E 



log 



log 



l + |htv3|2^ + |gtu3|2/2 

|htw|2(P-P«') 



'2|htw|2(pi-"'-l)" 



|htv3|2 

{I - a')\ogP + o{\ogP) 



+ o(logP) 
+ o(logP) 



where g = g/P 2 with covariance matrix I. By symmetry, we also have 

/(ci;y2,h,g) = (l-a')logP + o(logP) 



(63) 

(64) 

(65) 

(66) 

(67) 
(68) 

(69) 



Therefore, Pci = (1 - "') logP + o(logP). Note that ci contains 2 + [l + C, - a') logP bits. The 
number of time slots needed to send these bits is 



t2 



2 + (l + C-a')logP 



+ 



(i + C-«') 



{l-a')\ogP + o{\ogP) {I - a')\ogP + o{\ogP) i q./ | QJ^^^^p) ' 



(70) 



Since C, can be made arbitrarily small, ^2 ^ 1 as P ^ oc. In other words, at high SNR, phase 2 
only consists of one time slot. 

After decoding ci at receiver 1, since the channels are known to the receiver, it can remove ci 
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from the received signal. Then it can decode as which achieves the fohowing rate: 



R, 



as 



I(a3;y2,h,g|ci) 



= E 



E 



log 1 + 



log 1 + 






l + |gtvi|2/2 
= a' log P + o(log P) 
By symmetry, at receiver 2, 63 can achieve a rate 

^63 =-f(^3;y2,h,g|ci) = a'logP + o(logP) 

Phase 3 



(71) 
(72) 

(73) 
(74) 

(75) 



Phase 3 is very similar to Phase 2. In phase 3, the common information C2 will be sent to both 
receivers and again at the same time one private message will be sent to each receiver. Denote the 
codewords of private message for receiver 1 and 2 as a^ and 64, respectively. Then the transmitted 
signal is 

X = WC2 + V4a4 + U464 (76) 

where V4 and U4 with unit norm are chosen such that they are orthogonal to g and h, respectively. 
We set the powers of a^, 64 and C2 as P^ /2, P^ /2 and P — P^ , respectively. With the same 
analysis as phase 2, both a4 and 64 can achieve a' DoF and C2 can be delivered using ts time slots 
where ts — t2. 

After decoding ci and C2, both receivers know fji and 7)2. Now receiver 1 will use 7)1, 7)2 and 
7/1(1) to construct an effective 2x2 MIMO channel with input ai and a2 as follows. 



2/1(1) -fji 



ht(i) 
g^i) 



[ Vi V2 



ai 
a2 



+ 



^i(l) + Ai 

-A2 



(77) 



Similarly, receiver 2 will use 571, rj2 and ^2(1) to construct an effective MIMO channel with input 
61 and 62 as follows. 



2/2(1)-% 

m 



ht(i) 



[ Ui U2 



b2 



+ 



^2(1) + A2 

-Ai 



(78) 



As shown in [Ij, the probability of error Pg of decoding ai, a2, bi and 62 will go to zero as P ^ oc. 
And the rates of [ai a2] and [61 62] are (2 — Q:')logP + o(logP). Now, using random coding 
arguments, it can be seen for [ai a2] and [61 62], the reliable rates are approximately (1 — Pe)((2 — 
a^) logP + o(logP)). Since Pg will go to zero when P ^ oc, [ai a2] and [61 62] can achieve 2 — a^ 
DoF. 

Now we can calculate the achievable DoF using this scheme. When P ^ oc, phase 2 and 3 each 
consists of 1 time slot. Since phase 1 consists of 1 time slot, a total of 3 time slots are used. Over 
these 3 time slots, four symbols are delivered to each receiver. Therefore, 



d = 



2(2 - a' + a' + 0^0 _ 2(2 + ^0 



(79) 
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Replacing ol with 1 — a, we have 

2 

d = 2 - -a. (80) 

6 Conclusion 

We consider a two user MISO BC with two antennas at the transmitter. It is assumed the transmit- 
ter has delayed CSI and imperfect current CSI. We characterize the optimal DoF of this channel by 
providing both achievability and outer bounds. The results reveal the DoF optimal use of outdated 
and current CSIT. 
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